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In this article, we develop a relational algebra for metadata integration, Federated 
Interoperable Relational Algebra (FIRA). FIRA has many desirable properties such as 
compositionality, closure, a deterministic semantics, a modest complexity, support for 
nested queries, a subalgebra equivalent to canonical Relational Algebra (RA), and 
robustness under certain classes of schema evolution. Beyond this, FIRA queries are 
capable of producing fully dynamic output schemas, where the number of ... 
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Database management systems will continue to manage large data volumes. Thus, 
efficient algorithms for accessing and manipulating large sets and sequences will be 
required to provide acceptable performance. The advent of object-oriented and extensible 
database systems will not solve this problem. On the contrary, modern data models 
exacerbate the problem: In order to manipulate large sets of complex objects as 
efficiently as today's database systems manipulate simple records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible 
database systems, iterators, object-oriented database systems, operator model of 
parallelization, parallel algorithms, relational database systems, set-matching algorithms, 
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The Vesta parallel file system is designed to provide parallel file access to application 
programs running on multicomputers with parallel I/O subsystems. Vesta uses a new 
abstraction of files: a file is not a sequence of bytes, but rather it can be partitioned into 
multiple disjoint sequences that are accessed in parallel. The partitioning— which can also 
be changed dynamically— reduces the need for synchronization and coordination during 
the access. Some control over the layout ... 

Keywords: data partitioning, parallel computing, parallel file system 
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Flash memory is a type of electrically-erasable programmable read-only memory 
(EEPROM). Because flash memories are nonvolatile and relatively dense, they are now 
used to store files and other persistent objects in handheld computers, mobile phones, 
digital cameras, portable music players, and many other computer systems in which 
magnetic disks are inappropriate. Flash, like earlier EEPROM devices, suffers from two 
limitations. First, bits can only be cleared by erasing a large block of memory. S ... 
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Full text available: Iffi pdf(435.89 KB) Additlonal Information: full citation , a bstract , references , citings, index 
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We provide a principled extension of SQL, called SchemaSQL, that offers the capability of 
uniform manipulation of data and schema in relational multidatabase systems. We 
develop a precise syntax and semantics of SchemaSQL in a manner that extends 
traditional SQL syntax and semantics, and demonstrate the following. (1) SchemaSQL 
retains the flavor of SQL while supporting querying of both data and schema. (2) It can be 
used to transform data in a database in a structure substa ... 

Keywords: Information integration, SchemaSQL, multidatabase systems, restructuring 
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Our growing reliance on online services accessible on the Internet demands highly 
available systems that provide correct service without interruptions. Software bugs, 
operator mistakes, and malicious attacks are a major cause of service interruptions and 
they can cause arbitrary behavior, that is, Byzantine faults. This article describes a new 
replication algorithm, BFT, that can be used to build highly available systems that tolerate 
Byzantine faults. BFT can be used in practice to implement re ... 

Keywords: Byzantine fault tolerance, asynchronous systems, proactive recovery, state 
machine replication, state transfer 
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Randi Rost 

August 2004 ACM SIGGRAPH 2004 Course Notes SIGGRAPH '04 

Publisher: ACM Press 

Full text available: ^ pdf(7.39 MB ) Additional Information: full citation , abstract 

Real-time procedural shading was once seen as a distant dream. When the first version of 
this course was offered four years ago, real-time shading was possible, but only with one- 
of-a-kind hardware or by combining the effects of tens to hundreds of rendering passes. 
Today, almost every new computer comes with graphics hardware capable of interactively 
executing shaders of thousands to tens of thousands of instructions. This course has been 
redesigned to address today ! s real-time shading capabili ... 

A taxonomy of Data Grids for distributed data sha ring, mana g ement, and processing 
Srikumar Venugopal, Rajkumar Buyya, Kotagiri Ramamohanarao 
June 2006 ACM Computing Surveys (CSUR) f volume 38 issue l 
Publisher: ACM Press 

Full text available: ^ pdf(1.70 MB) Additional Information: full citation , abstract , references, index terms 

Data Grids have been adopted as the next generation platform by many scientific 
communities that need to share, access, transport, process, and manage large data 
collections distributed worldwide. They combine high-end computing technologies with 
high-performance networking and wide-area storage management techniques. In this 
article, we discuss the key concepts behind Data Grids and compare them with other data 
sharing and distribution paradigms such as content delivery networks, peer-to-peer n ... 

Keywords: Grid computing, data-intensive applications, replica management, virtual 
organizations 



9 Modeling the storage architectures of commercial database systems 
D. S. Batory 

December 1985 ACM Transactions on Database Systems (TODS), volume 10 issue 4 
Publisher: ACM Press 

Full text available: 151 pdf(4.46 MB) Addit ' onal Information: full citation , abstract , references , citings, index 
• ^ : terms , review 

Modeling the storage structures of a DBMS is a prerequisite to understanding and 
optimizing database performance. Previously, such modeling was very difficult because 
the fundamental role of conceptual-to-internal mappings in DBMS implementations went 
unrecognized. In this paper we present a model of physical databases, called the 
transformation model, that makes conceptual-to-internal mappings explicit. By exposing 
such mappings, we show that it is possible to model the storage ... 
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10 Course and exercise sequencing usin g metadata in adaptive h y permedia learnin g 

^ s ystems 

^ Stephan Fischer 

March 2001 Journal on Educational Resources in Computing (JERIC) 

Publisher: ACM Press 

Full text available' f£\ pdfd 15 01 KB) Additional Information: full citation , abstract , references , citings, index 
■ lajjj_j terms , review 

In the last few years the (semi-) automatic sequencing of course material has become an 
important research issue, particularly the standardization of metadata for educational 
resources. Sequencing can help to generate hypermedia documents which, at their best 
match the learner's needs. To perform (semi-) automatic course sequencing, a knowledge 
library as well as modular resources can be used. Both must be described by metadata. ... 

Keywords: adaptive hypermedia systems, hypermedia learning, knowledge engineering, 
sequencing of course material 



11 HFS: a performance-oriented flexible file system based on build ing-block 

^ compositions 

^ Orran Krieger, Michael Stumm 

August 1997 ACM Transactions on Computer Systems (TOCS), volume 15 issue 3 

Publisher: ACM Press 

Full text available* 113 pdf(383 87 KB) Adcl ' t ' ona ' Information: full citation , abstract , references , citings , index 
• Ld-^— 1 : terms, review 

The Hurricane File System (HFS) is designed for (potentially large-scale) shared-memory 
multiprocessors. Its architecture is based on the principle that, in order to maximize 
performance for applications with diverse requirements, a file system must support a wide 
variety of file structures, file system policies, and I/O interfaces. Files in HFS are 
implemented using simple building blocks composed in potentially complex ways. This 
approach yields great flexibility, allowing an application ... 

Keywords: customization, data partitioning, data replication, flexibility, parallel 
computing, parallel file system 



12 A model of multimedia information retrieval 
^ Carlo Meghini, Fabrizio Sebastiani, Umberto Straccia 
V September 2001 Journal of the ACM ( JACM), volume 48 issue 5 
Publisher: ACM Press 

Full text available: f a pdf(5.69 MB) Additional Information: full citation , abstract, references, citings, index 

terms 

Research on multimedia information retrieval (MIR) has recently witnessed a booming 
interest. A prominent feature of this research trend is its simultaneous but independent 
materialization within several fields of computer science. The resulting richness of 
paradigms, methods and systems may, on the long run, result in a fragmentation of 
efforts and slow down progress. The primary goal of this study is to promote an 
integration of methods and techniques for MIR by contributing a conceptual model ... 

Keywords: Description logics, fuzzy logics, multimedia information retrieval 
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13 Model-driven development of Web a p plications: the AutoWeb system 
^ Piero Fraternali, Paolo Paolini 

V 7 October 2000 ACM Transactions on Information Systems (TOIS), Volume 18 issue 4 
Publisher: ACM Press 

Full text available: pdf( 6.94 MB) Additional Information: full citation, abstract, references , cjtjngs, index 
' La terms 

This paper describes a methodology for the development of WWW applications and a tool 
environment specifically tailored for the methodology. The methodology and the 
development environment are based upon models and techniques already used in the 
hypermedia, information systems, and software engineering fields, adapted and blended 
in an original mix. The foundation of the proposal is the conceptual design of WWW 
applications, using HDM-lite, a notation for the specification of structure, nav ... 

Keywords: HTML, WWW, application, development, intranet, modeling 



14 The Conquest file system: Better performance through a disk/persistent-RAM hybrid jjgj 
design 

An-i Andy Wang, Geoff Kuenning, Peter Reiher, Gerald Popek 
August 2006 ACM Transactions on Storage (TOS), volume 2 issue 3 
Publisher: ACM Press 

Full text available: ^ pd f(1. 34 MB) Additional Information: full citation, abstract, references , index terms 

Modern file systems assume the use of disk, a system-wide performance bottleneck for 
over a decade. Current disk caching and RAM file systems either impose high overhead to 
access memory content or fail to provide mechanisms to achieve data persistence across 
reboots.The Conquest file system is based on the observation that memory is becoming 
inexpensive, which enables all file system services to be delivered from memory, except 
for providing large storage capacity. Unlike caching, Con ... 

Keywords: Persistent RAM, file systems, performance measurement, storage 
management 




15 Improving storage system availability with D-GRA I D 

Muthian Sivathanu, Vijayan Prabhakaran, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci- 
Dusseau 

May 2005 ACM Transactions on Storage (TOS), volume l issue 2 
Publisher: ACM Press 

Full text available: ^ pdf(700.30 KB) Additional Information: full citation, abstract , reference s, index terms 

We present the design, implementation, and evaluation of D-GRAID, a gracefully 
degrading and quickly recovering RAID storage array. D-GRAID ensures that most files 
within the file system remain available even when an unexpectedly high number of faults 
occur. D-GRAID achieves high availability through aggressive replication of semantically 
critical data, and fault-isolated placement of logically related data. D-GRAID also recovers 
from failures quickly, restoring only live file system data to a h ... 

Keywords: Block-based storage, Disk array, RAID, fault isolation, file systems, smart 
disks 
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17 Ext3cow: a time-shifting file system for regulatory compliance 
Zachary Peterson, Randal Burns 

May 2005 ACM Transactions on Storage (TOS), volume i issue 2 
Publisher: ACM Press 

Full text available: l f|l) pdf(44 3.01 KB) Additional Information: full citation , abstract , references , index terms 

The ext3cow file system, built on the popular ext3 file system, provides an open-source 
file versioning and snapshot platform for compliance with the versioning and audtitability 
requirements of recent electronic record retention legislation. Ext3cow provides a time- 
shifting interface that permits a real-time and continuous view of data in the past. Time- 
shifting does not pollute the file system namespace nor require snapshots to be mounted 
as a separate file system. Further, ext3cow is i ... 

Keywords: Versioning file systems, copy-on-write 





18 Cheap reco ver y: a ke y to self-mana ging state 
dtK Andrew C. Huang, Armando Fox 

February 2005 ACM Transactions on Storage (TOS), volume l issue l 

Publisher: ACM Press 

Full text available: ^pdf(1.24 MB) Additional Information: full citation , abstract , references , index terms 

Cluster hash tables (CHTs) are key components of many large-scale Internet services due 
to their highly-scalable performance and the prevalence of the type of data they store. 
Another advantage of CHTs is that they can be designed to be as self-managing as a 
cluster of stateless servers. One key to achieving this extreme manageability is reboot- 
based recovery that is predictably fast and has modest impact on system performance 
and availability. This "cheap" recovery mechanism simplifies manageme ... 

Keywords: Cluster hash table, manageability, quourum replication, storage systems 
design 
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May 2006 ACM Transactions on Storage (TOS), volume 2 issue 2 
Publisher: ACM Press 

Full text available: *g pdf(260.40 KB ) Additional Information: full citation , abstract, references , index terms 

Developing file systems from scratch is difficult and error prone. Using layered, or 
stackable, file systems is a powerful technique to incrementally extend the functionality of 
existing file systems on commodity OSes at runtime. In this article, we analyze the 
evolution of layering from historical models to what is found in four different present day 
commodity OSes: Solaris, FreeBSD, Linux, and Microsoft Windows. We classify layered file 
systems into five types based on their functionality and ... 

Keywords: I/O manager, IRP, Layered file systems, VFS, extensibility, stackable file 
systems, vnode 
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The 3-Dimensional Information Space (3DIS) is an extensible object-oriented framework 
for information management. It is specifically oriented toward supporting the database 
requirements for data-intensive information system applications in which (1) information 
objects of various levels of abstraction and modalities must be accommodated, (2) 
descriptive and structural information (metadata) is rich and dynamic, and (3) users who 
are not database experts must be able to design, manipulate, a ... 
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